AMENDMENT AND RESPONSE UNDER 37 CFR § 1.116- EXPEDITED PROCEDURE 

Serial Number: 09/899,424 
Filing Date: July 3, 2001 

Title: DISTRIBUTION THEORY BASED ENRICHMENT OF SPARSE DATA FOR MACHINE LEARNING 



IN THE CLAIMS 

Please amend the claims as follows. 

1 . (Currently Amended) A computer-implemented method for enriching sparse data for 
machine learning, comprising: 

receiving the sparse dat a, wherein the received data comprises data selected from the 
group consisting of static data and real-time data ; 

if the received data is static data, then reading a sample of the received static data using a 
predetermined window length; and 

if the received data is real-time data, then reading a sample of the received real-time data 

using a dynamically varying window of predetermined window length; 

enriching the received data around a deviation of the mean of the received data using a 
predetermined distribution; and 

outputting the enriched data for unbiased learning and improved performance during the 
machine learning. 

2. (Original) The method of claim 1, wherein machine learning comprises: 
supervised artificial neural network learning. 

3. (Original) The method of claim 1, further comprising: 
checking the received data for sparseness; and 

enriching the checked data around the deviation of the mean of the received data based 
on the outcome of the checking. 

4. (Original) The method of claim 1, wherein checking the received data further comprises: 
comparing the received data with a predetermined number. 



Page 2 

Dkt: H0002101 US 



5. 



(Original) The method of claim 4, wherein enriching the received data further comprises: 
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enriching the received data around the deviation of the mean of the received data based 
on the outcome of the comparison. 

6. (Original) The method of claim 1, further comprising: 
rearranging the received data based on class. 

7. (Original) The method of claim 6, further comprising: 

normalizing the rearranged data based on attributes in the rearranged data. 

8. (Original) The method of claim 6, further comprising: 

checking each class of data in the rearranged data for sparseness; and 
enriching each class of data around a deviation of the mean associated with the respective 
class based on the outcome of the checking. 

9. (Original) The method of claim 8, wherein checking each class of data further 
comprises: 

comparing each class of data to a predetermined number. 

10. (Original) The method of claim 9, wherein enriching each class of data comprises: 
enriching each class around a deviation of the mean associated with the respective class 

based on the outcome of the comparison. 

1 1 . (Original) The method of claim 10, wherein enriching each class around a deviation of 
the mean associated with the respective class further comprises: 

computing the mean and standard deviation for each class of data in the rearranged data; 

and 

generating additional data for each class using the associated computed mean and 
standard deviation. 
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12. (Original) The method of claim 11, wherein generating additional data further 
comprises: 

generating additional data between limits computed using the equation: 
x ± ka 

wherein x is the computed mean associated with each class, A: is a constant varying 
between 0.25 to 3, and a is the computed standard deviation associated with each class. 

13. (Cancelled) The method of claim 12, wherein the predetermined distribution further 
comprises: 

arranging the enriched data using the equation: 
[X mn J[WJ=[BJ 

wherein W is a weight matrix, X is input patterns, and Z?, f s are the classes; and 
rearranging in the max-min-max pattern: 
Let for class / 

(RX} N - Rx 2 n) > (R*2N ~ R*3n) > • • . > (RX(i-I)N-RXiN) ~ 
(RX(j+2)N- RX(i+l)N) < (RX(i+3) ~ RX(i+2)) < < (I&AN ~ RX(A-I)n) 

where Rxj N ^ Row xjm are enriched data values. 

14. (Currently Amended) A computer-implemented method for enriching sparse data for 
machine learning, comprising: 

receiving the sparse data; 

enriching the received data around a deviation of the mean of the received data using a 

predetermined distribution The m e thod of claim 1 . wherein the predetermined distribution 
comprises distributions selected from the group consisting of normal distribution, exponential 
distribution, logarithmic distribution, chi-square distribution, t-distribution, and F-distribution; 
and 

outputting the enriched data for unbiased learning and improved performance during the 

machine learning . 
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15. (Cancelled) The method of claim 1, wherein the received data comprises data selected 
from the group consisting of static data and real-time data. 

16. (Cancelled) The method of claim 15, further comprising: 

if the received data is static data, then reading a sample of the received static data using a 
predetermined window length; and 

if the received data is real-time data, then reading a sample of the received real-time data 
using a dynamically varying window of predetermined window length. 

17. (Currently Amended) The method of claim 1[[16]], further comprising: 

if the received data is real-time data, then repeating the reading of the sample of the 
received real-time data using a dynamically varying window of predetermined window length. 

18. (Currently Amended) A computer readable medium having computer-executable 
instructions for performing a method of machine learning when only sparse data is available, 
comprising: 

enriching the sparse data around a deviation of the mean of the received data using a 
predetermined distribution selected from the group consisting of normal distribution, exponential 
distribution, logarithmic distribution, chi-square distribution, t-distribution, and F-distribution ; 
and 

outputting the enriched data for unbiased machine learning. 

19. (Original) The computer readable medium of claim 18, wherein machine learning 
comprises: 

supervised artificial neural network learning. 

20. (Original) The computer readable medium of claim 18, further comprising: 
checking the received data for sparseness; and 

enriching the received data around the deviation of the mean of the received data based 
on the outcome of the checking. 
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21. (Original) The computer readable medium of claim 18, wherein checking the received 
data further comprises: 

comparing the received data with a predetermined number. 

22. (Original) The computer readable medium of claim 21, wherein enriching the received 
data further comprises: 

enriching the received data around the deviation of the mean of the received data based 
on the outcome of the comparison. 

23. (Original) The computer readable medium of claim 18, further comprising: 
rearranging the received data based on class. 

24. (Original) The computer readable medium of claim 23, further comprising: 
normalizing the rearranged data based on attributes in the rearranged data. 

25. (Original) The computer readable medium of claim 23, further comprising: 
checking each class of data in the rearranged data for sparseness; and 

enriching each class of data around a deviation of the mean associated with the respective 
class based on the outcome of the checking. 

26. (Original) The computer readable medium of claim 25, wherein checking the each class 
of data further comprises: 

comparing each class of data to a predetermined number. 

27. (Previously Presented) The computer readable medium of claim 26, wherein enriching 
the each class of data comprises: 

enriching each class around a deviation of the mean associated with the respective class 
based on the outcome of the comparison. 
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28. (Original) The computer readable medium of claim 27, wherein enriching each class 
around a deviation of the mean associated with the respective class further comprises: 

computing the mean and standard deviation for each class of data in the rearranged data; 

and 

generating additional data for each class using the associated computed mean and 
standard deviation. 

29. (Original) The computer readable medium of claim 28, wherein generating additional 
data further comprises: 

generating additional data between limits computed using the equation: 
x ± ka 

wherein x is the mean associated with each class, & is a constant varying between 0.25 
to 3, and a is the standard deviation associated with each class. 

30. (Cancelled) The computer readable medium of claim 29, wherein the predetermined 
distribution further comprises: 

arranging the enriched data using the equation: 
[X mn ] [W]=[BJ 

wherein W is a weight matrix, X is input patterns, and Bi's are the classes; and 
rearranging in the max-min-max pattern: 
Let for class / 

(Rxjm - RX2N) > (R*2N ~ R*3n) > • • • > (R x (i-l)N -R^In) ~ 
(RX(i+2)N~ RX(i+I)N) < (RX(i+3) ~ &K(i+2)) < • ■ < (R*AN ' RX(A-1)n) 

where Rxi N ^ Row x^. 

wherein Rx }N is the first row of the Xth (reference) class consisting of TV features, Rx2n is 
the second row of t\iz Xth (reference) class consisting of TV features, and so on. 

3 1 . (Cancelled) The method of claim 1 8, wherein the predetermined distribution comprises 
distributions selected from the group consisting of normal distribution, exponential distribution, 
and logarithmic distribution. 
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32. (Original) The computer readable medium of claim 18, wherein the received data 
comprises data selected from the group consisting of static data and real-time data. 

33. (Original) The computer readable medium of claim 32, further comprising: 

if the received data is static data, then reading a sample of the received static data using a 
predetermined window length ; and 

if the received data is real-time data, then reading a sample of the received real-time data 
using a dynamically varying window of predetermined window length. 

34. (Original) The computer readable medium of claim 33, further comprising: 

if the received data is real-time data, then repeating the reading of the sample of the 
received real-time data using a dynamically varying window of predetermined window length. 

35. (Currently Amended) A computer system for a machine learning in a sparse data 
environment, comprising: 

a storage device; 

an output device; and 

a processor programmed to repeatedly perform a method, comprising: 
receiving the data; 

enriching the received data around a deviation of mean of the received data using 
a predetermined distribution selected from the group consisting of normal distribution, 
exponential distribution, logarithmic distribution, chi-square distribution, t-distribution, 
and F-distribution ; and 

outputting the enriched data for unbiased machine learning. 

36. (Original) The system of claim 35, wherein machine learning comprises: 
supervised artificial neural network learning. 



37. 



(Original) The system of claim 35, further comprising: 
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rearranging the received data based on class. 

38. (Original) The system of claim 37, further comprising: 
normalizing the rearranged data based on attributes in the rearranged data. 

39. (Original) The system of claim 37, further comprising: 
checking each class of data in the rearranged data for sparseness; and 
enriching each class of data around a deviation of the mean associated with the respective 

class based on the outcome of the checking. 

40. (Original) The system of claim 39, wherein checking the each class of data further 
comprises: 

comparing each class of data to a predetermined number. 

41. (Original) The system of claim 40, wherein enriching each class of data comprises: 
enriching each class around a deviation of mean associated with the respective class 

based on the outcome of the comparison. 

42. (Original) The system of claim 41, wherein enriching the each class around a deviation 
of mean associated with the respective class further comprises: 

computing the mean and standard deviation for each class of data in the rearranged data; 

and 

generating additional data for each class using the associated computed mean and 
standard deviation. 

43. (Original) The system of claim 42, wherein generating additional data further comprises: 
generating additional data between limits computed using the equation: 

x ± ka 

wherein x is the mean associated with each class, A: is a constant varying between 0.25 
to 3, and a is the standard deviation associated with each class. 
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44. (Cancelled) The system of claim 35, wherein the predetermined distribution comprises 
distributions selected from the group consisting of normal distribution, exponential distribution, 
and logarithmic distribution. 

45. (Currently Amended) A computer-implemented system for machine learning in a sparse 
data environment, comprising: 

a receive module to receive sparse data; 

a database coupled to the receive module to receive and store sparse data; and 

a unique numeric transformation module coupled to the database to extract words from 

text stored in the database and to transform each of the extracted words into a unique numerical 
representation; 

an analyzer to enrich the received data around a deviation of the received data using a 
predetermined distribution; and 

an output module coupled to the analyzer to output the enriched data for unbiased 
learning and increased performance during machine learning. 

46. (Original) The system of claim 45, further comprising: 

a database coupled to the receive module to receive and store sparse data. 

47. (Original) The system of claim 45, wherein machine learning comprises: 
supervised artificial neural network learning. 

48. (Original) The system of claim 45, further comprising: 

a comparator coupled to the analyzer to check the received data for sparseness, wherein 
the analyzer enriches the checked data around the deviation of the mean of the received data 
based on the outcome of the checking. 

49. (Original) The system of claim 48, wherein the comparator checks the received data for 
sparseness by comparing the received data with a predetermined number. 
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50. (Original) The system of claim 49, wherein the analyzer enriches the received data 
around the deviation of the mean of the received data based on the outcome of the comparison. 

5 1 . (Original) The system of claim 50, wherein the analyzer rearranges the received data 
based on class. 

52 (Original) The system of claim 51, wherein the analyzer normalizes the rearranged data 
based on attributes in the data. 

53. (Original) The system of claim 51, wherein the analyzer checks each class of data for 
sparseness, and enriches each class around a deviation of the mean associated with the class 
based on the outcome of the checking by the analyzer. 

54. (Original) The system of claim 53, wherein the comparator compares each class in the 
rearranged data with a predetermined number and wherein the analyzer enriches each class 
around a deviation of the mean associated with the class based on the outcome of the comparison 
by the comparator. 

55. (Original) The system of claim 54, wherein the analyzer enriches data in each class by 
computing a mean and standard deviation for each class in the rearranged data, and the analyzer 
further generates additional data for each class based on the respective computed mean and 
standard deviation. 

56. (Original) The system of claim 55, wherein the analyzer generates additional data 
between limits computed using the equation: 

x ± k<J 

wherein x is the mean associated with each class in the rearranged data, k is a constant 
varying between 0.25 to 3, and a is the standard deviation associated with each class in the 
rearranged data. 
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57. (Cancelled) The system of claim 56, wherein the analyzer further computes additional 
data using the equation: 

[X«uJ[WJ=[BJ 

wherein W is a weight matrix, X is input patterns, and B x 's are the classes; and 
rearranging in the max-min-max pattern: 
Let for class i 

(RxiN ~ R*2n) > (R*2N ~ R*3n) > • • ■ >(RX(i-l)N -R^in) ~ 
(RX(i+2)N~ RX(i+l)N) < (RX(i+3) - RX(i+2)) < < (R*AN ~ RX(A-I)n) 

where Rxjn ^ Rowxj^. 

58. (Currently Amended) A computer-implemented system for machine learning in a sparse 
data environment, comprising: 

a receive module to receive sparse data. Th e system of claim 4 5, wherein the received 

data comprises data selected from the group consisting of static data and real-time data ; 

a reading module coupled to the receive module that reads a sample of the received data 
having a predetermined window length; 

an analyzer to enrich the received data around a deviation of the received data using a 

predetermined distribution; and 

an output module coupled to the analyzer to output the enriched data for unbiased 

learning and increased performance during machine learning . 

59. (Cancelled) The system of claim 58, further comprising: 

a reading module coupled to the receive module reads a sample of the received data 
having a predetermined window length. 

60. (Cancelled) The system of claim 59, wherein the reading module reads the sample of the 
received data using a predetermined window length when the read data is static data, and reads a 
sample of the received data using a dynamically varying window of predetermined window 
length when the read data is real-time data. 
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61 . (Currently Amended) The system of claim 58 [[60]], wherein the reading module repeats 
the reading of the sample of the received data using a dynamically varying window of 
predetermined window length when the received data is real-time data. 

62. (Cancelled) The system of claim 45, further comprising: 

a database coupled to the receive module to receive and store sparse data; and 

a unique numeric transformation module coupled to the database to extract words from 

text stored in the database and to transform each of the extracted words into a unique numerical 

representation. 



